CDS

Accession Number TCMCG052C15576
gbkey CDS
Protein Id CAB4276362.1
Location complement(join(4176935..4177041,4177100..4177229,4177542..4177604,4177689..4177727,4178626..4178709,4178904..4178948,4179214..4179290,4179472..4179640,4179789..4179893,4180436..4180573,4180842..4180961,4181208..4181609))
Organism Prunus armeniaca
locus_tag CURHAP_LOCUS25460

Protein

Length 492aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJEB37669, BioSample:SAMEA6812185
db_source embl accession CAEKDK010000004.1
Definition unnamed protein product [Prunus armeniaca]
Locus_tag CURHAP_LOCUS25460

EGGNOG-MAPPER Annotation

COG_category M
Description tail specific protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K03797        [VIEW IN KEGG]
EC 3.4.21.102        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004175        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0006508        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008233        [VIEW IN EMBL-EBI]
GO:0009579        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0031977        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044436        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0070011        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0140096        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAGGGTTTTGTTGCTCAGCAACACCACCACACTCTCACTATCTTCATTACCACCACCGCGAACCCCAAAATCCCCAATTCGATCCAATTTCAGACGCAATTCAATCAATTGGGCCGAGAAAGCTCTAATTGGAGCCCTAGGTGGTGCTCTGTCGTTCGGTCTTTTGTTCTCTTCGCCCTCTTCTTCCATAGCGATTGAGTTTTCTTCTTCTTCCTCCGTTCAAGTCCAACCTTCTTCGCAGCCACCGGAGTTTTGCAGCGAAGATGAAGGAGATGAAACGGCCGAGTTAGGGTCCGAACCGGCCGTGACCAGTGAGGGAGTTGTGGAGGAAGCTTGGGAAATCGTCAATGACAGCTTTCTCAACACCAGTGGCTCTCGCTCGTTCCCGGAAACTTGGCAGAGGAAAAAGGAAGACATAAGGAGTAGTTCAATAAAGACAAGATCAAAGGCTCATGATACGATTAAGCGAATGTTGGCCAGCTTGGGTGACCCTTATACCCGATTTCTTTCGCCTGAAGAGTTCTCCAAGATGGCGAGGTATGACATGAGTGGTATTGGAATAAACCTCAGGGAAGTTCCAGATGACAATGGAGATGTGAAATTGAAGGTTCTAGGACTTGTATTAGATGGCCCTGCACATTCTGCTGGTGTGAGACAGGGGGATGAAGTACTAGCTGTTAATGGATTGGATGTGAAGGGGAAATCAGCCTTTGAAGTATCATCGATGATGCAAGGTCCTAACGAAACTTTCGTTACTATTAAGGTCAAGCATGGAAATTGTGGGCCTATTCAATCTATTGAAGTCCAAAGACAACTTGTTGCTCGAACCCCTGTCTCTTATCGGTTGGAACAAATAGAAAATGGAACCAGATCTGTTGGATACACTCGCGTAAAAGAGTTCAATGCATTGGCTAGAAAAGACTTGGTAACTGCTATGAAGCGACTTCAGGACATGGGTGCATCATACTTTATTCTGGATCTTAGAGATAATCGTGGTGGACTAGTACAGGCTGGAATAGAAATTGCCAAGCTATTTTTGAATGAAGGGGAGACGGTGATTTATACTGATGGGAAGGTTCCCGAATACCAACAAAGTATCGTTGCAGATACTGCACCATTAGTTACAGCTCCTGTTATCGTTTTGGTGAACAACAATACTGCTAGTGCTAGTGAAATTGTTGCTTCAGCTTTGCATGATAATTGTAGAGGTGTTCTTGTTGGTGAACGGACATTTGGCAAGGGTTTGATTCAATCCGTGTTTGAACTTCGTGATGGCTCTGGTGTGGTTGTAACTGTTGGGAAGTATGTTACGCCAAAACATAAGGACATAAATGGCAATGGAATAGAGCCTGATTATCGAAATTTCCCAGGATCTCTTTATCCGCAGCTTGGAGTGACGTCACACAACATCTTTCACAGTGTAATATGCTTCAGCGGGGATAGATCAGTGGTTGACACAGTTCTTTCTTACCTTTGA
Protein:  
MRVLLLSNTTTLSLSSLPPPRTPKSPIRSNFRRNSINWAEKALIGALGGALSFGLLFSSPSSSIAIEFSSSSSVQVQPSSQPPEFCSEDEGDETAELGSEPAVTSEGVVEEAWEIVNDSFLNTSGSRSFPETWQRKKEDIRSSSIKTRSKAHDTIKRMLASLGDPYTRFLSPEEFSKMARYDMSGIGINLREVPDDNGDVKLKVLGLVLDGPAHSAGVRQGDEVLAVNGLDVKGKSAFEVSSMMQGPNETFVTIKVKHGNCGPIQSIEVQRQLVARTPVSYRLEQIENGTRSVGYTRVKEFNALARKDLVTAMKRLQDMGASYFILDLRDNRGGLVQAGIEIAKLFLNEGETVIYTDGKVPEYQQSIVADTAPLVTAPVIVLVNNNTASASEIVASALHDNCRGVLVGERTFGKGLIQSVFELRDGSGVVVTVGKYVTPKHKDINGNGIEPDYRNFPGSLYPQLGVTSHNIFHSVICFSGDRSVVDTVLSYL